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Abstract 

We present a new approach to the proof of ergodic theorems for ac- 
tions of free groups which generalize the classical geometric covering and 
asymptotic invariance arguments used in the ergodic theory of amenable 
groups. Existing maximal and pointwise ergodic theorems for free group 
actions are extended to a large class of geometric averages which were not 
accessible by previous techniques. 
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1 Introduction 

Let G be a locally compact second countable group with Haar measure m, and 
let B t for t G N or t € M be a family Borel sets of positive finite measure. 
Let fi t be probability measures supported on B t . Suppose G acts by measure- 
preserving transformations on a probability space (X, A). For any / € i 1 (A, A) 
we may consider the averaging operator 

At [/](*) := / /GT 1 *) dMg). 

Let E[/|G] denote the conditional expectation of / with respect to the a-algebra 
of G-invariant subsets. We say that {fJ-t} is a pointwise ergodic family in L p if 
A 4 [/] converges to E[/|G] pointwise almost everywhere and in L p -norm for every 
/ G L P {X, A) and for every measure-preserving action of G on a probability 
space (X, A). 

The most useful pointwise ergodic families are those in which B t are naturally 
connected with the geometry of the group. A basic case to consider is when B t is 
the ball of radius t > with respect to an invariant metric, and /i t is the Haar- 
uniform probability measure on B t , namely the density of \Xt is \B t /iTi(B t ) . 
Such averages are referred to as ball averages. Spherical and shell averages are 
defined similarly. 

Most of the research on ergodic theorems has focused on the case when the 
group is amenable and the averages are Haar-uniformly distributed on sets which 
form an asymptotically invariant (F0lner) sequence. The covering properties of 
translates of these sets and their property of asymptotic invariance play an 
indispensable role in the arguments developed in the amenable case, and we 
refer to |Ne05] for a detailed survey of these methods and current results. 

In contrast, non-amenable groups do not admit asymptotically invariant se- 
quences, and so the arguments developed to handle amenable groups are not 
directly applicable. An alternative general approach to the ergodic theory of 
group actions based on the spectral theory of unitary representations was de- 
veloped and applied to the case where G is a semisimple iS-algebraic group, or 
a lattice subgroup of such a group. We refer to |GN10| for a detailed account 
of this theory and to |GN09] for some of its applications. Naturally, reliance 
on harmonic analysis techniques limits the scope of this theory to groups whose 
unitary representation theory can be explicated, and to their lattice subgroups. 

For general groups, and certainly for discrete groups such as (non-elementary) 
word-hyperbolic groups for example, spectral information is usually unavailable 
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and harmonic analysis techniques are usually inapplicable. Exceptions do ex- 
ist, and for example it was proven by spectral methods that ball averages with 
respect to certain invariant metrics on the free group do indeed form pointwisc 
ergodic sequences. The metrics allowed are those arising from first fixing an 
embedding of the free group as a lattice in a locally compact group. Thus in 
|Ne94) |NS94j the free group is viewed as a lattice in the group of automorphisms 
of a regular tree, and in |GN10| as a lattice in PSLzfNL), and the metric is ob- 
tained by restricting a suitable G-invariant metric to the lattice subgroup. Note 
that in the case of the tree metric a periodicity phenomenon arise, namely the 
balls form a pointwise ergodic sequence if and only if the sign character of the 
free group does not appear in the spectrum. 

In the case of the tree metric, a proof of the ergodic theorem in L log L was 
given by |BuOOI . using Markov operators (inspired by earlier related ideas in 
|Gr99| ). This method extends to groups with a Markov presentation (which 
include all hyperbolic groups) . The averaging sequences obtained are related to 
the Markov presentation rather than a metric structure on the group. 

In the present paper we develop a new approach to pointwise ergodic the- 
orems for actions of free groups, based on geometric covering and asymptotic 
invariance arguments. This approach has two significant advantages: first, it 
constitutes a direct generalization of the classical arguments employed to prove 
ergodic theorems for amenable groups, and in fact reduces the proof of er- 
godic theorems for the free group to the proof of ergodic theorems for a certain 
amenable equivalence relation. Second, as will be shown in forthcoming work, 
the new ideas extend beyond the class of free groups to word-hyperbolic groups 
|BN1| . semisimple Lie groups |BN2j . and others. 

Our goal in what follows is to explain our method in detail in the most acces- 
sible case, namely that of free groups, and show how to use it to generalize the 
existing ergodic theorems on free groups. The main results establish maximal 
inequalities and pointwise convergence for a wide class of geometrically defined 
averages not accessible by previous techniques, one simple example being sec- 
tor averages (defined below) . We also establish the integrability of the maximal 
function associated with these sequences when the original function is in L log L, 
and thus also pointwise convergence of the averages acting on functions in this 
space. 

1.1 Statement of the main theorems 

Let F = (ai, . . . , a r ) denote the free group on r generators. Let S = {a,, }[ =1 
be the associated symmetric generating set. For every nonidentity element g £ 
F, there is a unique sequence t\, . . . ,t n of elements in S such that g = t±- ■ -t n 
and n > 1 is as small as possible. Define \g\ = n. Let d¥ be the boundary of 
F which we identify with the set of all infinite sequences (s\,S2, ■ ■ ■) <E S N such 
that Si+i 7^ for all i > 1. If g = t\ ■ ■ -t n as above then the shadow of g 
(with light source at e) is the compact open set 

O g = {(ai,s 2 ,...) G d¥ : Si =U forl<i<n}. 
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The boundary admits a natural probability measure v such that v(O g ) = 
(2r)- 1 (2r-l)-l«l +1 . 

We denote the sphere of radius n in F by S n {e) = {g £ F : \g\ = n}. Let i\> be 
any probability density function on <9F; namely ip > and J dr ip dv — 1. Define 
the associated probability measures /if on if> n (e) given by H„(g) = Jo ^ 

Let F 2 < F be the subgroup generated by all elements g such that \g\ is even. 
It is a subgroup of index 2 in F. Given a probability space (X, A) on which F acts 
by measure-preserving transformations, wc let E[/|F 2 ] denote the conditional 
expectation of a function / £ L 1 (X, A) on the cr-algebra of F 2 -invariant sets. 

Theorem 1.1. Fix any continuous probability density function ip on the bound- 
ary d¥. Then in any measure-preserving action o/F on a standard probability 
space (X, A), and any f £ L P (X) for 1 < p < oo, the averages /J% n (f) £ L P (X) 
defined by 

/&(/)(*):= E /gt^mIg?) 

ges 2 „(e) 

converge pointwise almost surely and in L p '-norm to E[/|F 2 ]. Furthermore, 
pointwise convergence to the same limit holds for any f in the Orlicz space 
(LlogL)(X,A). 

Remark 1.1. In the special case in which the density is identically 1, each p,2 n is 
the uniform average on S l 2 n (e), and the theorem states that even-radius spherical 
averages converge pointwise a.e. to E[/|F 2 ], for all / G L p , 1 < p < oo and 
/ € LlogL. The proof of Theorem |1.1| is completely different and independent 
of the previous proofs of this fact in |Ne94j . |NS94j and [BuOOj . 

Remark 1.2. Given w € F, we can choose the density p w — Xo m / v {O w ) to be 
the normalized characteristic function of a basic compact open subset O w of d¥. 
Thus, the sequence [1%™ of uniform averages on the set of all words of length 
2n > \w\ with initial subword w is a pointwise ergodic sequence. It is natural 
to call these averages (in analogy with the hyperbolic plane) sector averages. 

Theorem |1.1| is a special case of a more general result, whose statement 
requires further notation. For g € F, let 5 g £ £ X (F) be the function 5 g (g') = 1 
if g = g' an( i otherwise. Let 7rg : ^ 1 (F) — > L 1 (9F, v) be the linear map 
satisfying irg(S g ) — v{O g )~ 1 \o where xo is thfi characteristic function of O g . 
If jit £ £ 1 (¥) and > then K d \fx) > and \\ir d (v)\\i = WlA\i- 

Theorem 1.2. Let {^2n}'^Li be a sequence of probability measures in ^(F) 
such that \i2n is supported on the sphere S2n(e). Let 1 < q < oo, and suppose 
{ 7r a(M2n)}^ < Li converges in L q (d¥, v). Let [X, A) be a probability space on which 
F acts by measure-preserving transformations. Lf f £ L P (X), 1 < p < oo and 
- + - < 1, then the sequence {^2n{f)}^ = i C L P {X) defined by 

H2n(f){x) ■■= E f(g~ lx )^n{g) 
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converges pointwise almost surely and in L p -norm to E[/|F 2 ]. Furthermore, if 
q = oo and {7T9(/i2n)}^Li converges uniformly, then pointwise convergence to 
the same limit holds for any f in the Orlicz space (L log L) (X, A) . 

Theorem 1 1 . 1 1 follows from Theorem |1. 2 1 To see this, fix a continuous proba- 
bility density ip on d¥. Then the associated averages (defined above) satisfy 
limn^oo Trg(fj,^) — ijj in L q (d¥, v), for all 1 < q < oo. Ind eed, the continuous 



functions 7rg(/i^) converge to ip uniformly. Hence Theorem 1.2 applies. 



There are several natural questions raised by Theorem 1.2 For example 



does the conclusion of Theorem 1.2 hold if the hypothesis that [im is supported 
on {g £ ¥ : \g\ — 2n} is weakened to the condition lim„_i. 00 /i2n(<?) — for all g € 
F? Does it hold if the inequality ~ + = < 1 is replaced by the weaker constraint 

i + ™ < 1? What if instead of being convergent in L q (d¥, v), {ird{n2n)}^\ is 
only required to be pre-compact or norm-bounded? 



1.2 On the ideas behind the proof 

To illustrate our approach, consider the following scenario. Suppose that G is 
a group and H < G is a subgroup. We say that H has the automatic ergodicity 
property if whenever G acts on a probability space (X, fi) by measure-preserving 
transformations ergodically then the action restricted to H is also ergodic. In 
this case, any pointwise ergodic sequence for if is a pointwise ergodic sequence 
for G. If H is amenable then we can use the classical theory to find such 
sequences in H. Then conjugate copies of pointwise ergodic sequences can be 
averaged to construct additional pointwise ergodic sequences supported on G. 

For example, if G = SL2 (M) then, by the Howe-Moore Theorem, any closed 
noncompact subgroup H < G has the automatic ergodicity property. In |BN2j 
we use the foregoing observation to prove pointwise ergodic theorems by averag- 
ing on conjugates of a horospherical (unipotent) subgroup, which is isomorphic 
to E. Similar considerations apply to other Lie groups as well. 

To handle free groups we will have to modify this approach by considering 
an appropriately chosen amenable "measurable subgroup" . This "subgroup" is 
a probability measure on the space of horospheres containing the identity el- 
ement. Such horospheres have an intrinsic geometric structure and a natural 
notion of asymptotically invariant (F0lner) sequence. We develop variants of the 
classical covering arguments and establish pointwise convergence for averaging 
on F0lner sequences along horospheres. The space of all horospheres containing 
the identity is identifiable with the boundary d¥. We show that this "measur- 
able subgroup" satisfies an analogue of automatic ergodicity (a more general 
result is proven in [B0O8] for all word hyperbolic groups). This is related to 
the fact that the action of F on its boundary is weakly mixing [AL05] . By av- 
eraging appropriately chosen horospherical F0lner sequences over the space of 
horospheres, we obtain that the uniform measures on spheres form a pointwise 
ergodic sequence for F. Our approach allows much more general types of aver- 
aging sequences to be analyzed similarly, since we can average the horospherical 
sequences with respect to a variety of measures on the boundary. 
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1.3 Outline of the paper 

We begin by proving ergodic theorems for equivalence relations in f|2] This 
involves a direct generalization of classical arguments. In <|3] we review the 
boundary of F, horospheres and horofunctions on F. After these preliminaries, 
we state a pointwise ergodic theorem for averages along horospheres (<Q and 
prove it using results of |2] In Sj5]wc turn to ergodicity and periodicity, and 
prove that an ergodic action of F gives rise to a 'virtually ergodic' action of the 
associated measurable subgroup. In the last section we analyze integration of 
the horospherical averages over the space of horospheres and prove Theorem 
PI 



2 An ergodic theorem for equivalence relations 

Let (B, v) be a standard Borel probability space and K(B) C B x B be a 
Borel equivalence relation (i.e., for all b,b',b" E B, (b,b) € K(B), (b,b') € 
11(B) => (b',b) e K B , 0, V), (&', b") € =► [b,b") € K(B)), with countable 

equivalence classes. Let c denote counting measure on £? (so c(E) — #E WE C 
B). The measure i/ on B is K(B) -invariant ii v x c restricted to K(B) equals 
cxi/ restricted to K(B). A Borel map 4> : B — > B is an inner automorphism of 
K(B) if it is invertible with Borel inverse and its graph is contained in K(B). 
Let Inn (72.(5)) denote the group of inner automorphisms. If v is 7?.(i?)-invariant 
then <j)*v = v for every <f> £ Jxm(K(B)). For the rest of this section, we assume 
v is a 72(-B)-invariant Borel probability measure on B. 

A basic example to keep in mind is the following special case: suppose G is 
a discrete group acting my measure-preserving transformations on (B, v). Then 
the orbit-equivalence relation K(B) := {(b,gb) : b 6 B,g £ G} is such that 
v is 7?.(-B)-invariant. In fact, a result of }FM77| implies that all probability 
measure-preserving discrete equivalence relations arise from this construction 
(up to isomorphism). 

Suppose that T = {J- n }^Li is a sequence of functions F n : B —> 2 B n (where 
2g n denotes the space of finite subsets of B) such that for each n, {(b, b') : b' G 
J- n (b)} C B x B is a Borel subset of K(B). We are concerned with several 
properties such a sequence could satisfy: 

1. A set <f> C Iim(K(B)) generates K(B) with respect to v if for v x c a.e. 
(61,62) € 7£(-B) there exists G ($) such that 0(&i) = b 2 (where ($) 
denotes the group generated by $). 

2. J 7 is asymptotically invariant (or F0lner) with respect to ^ if there exists 
a countable set $ C Inn(7?.(i?)) which generates K(B) such that 

hm I^WA^W)I =0 V0e ^_ a . e . &eB . 

3. J 7 is non-shrinking with respect to ^ if there is a constant C s > such that 
for any Borel Y <Z B and any bounded measurable function p : Y — > N we 
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have 

"(U^pivM-- y£Y})>c s u(Y). 

This property is trivially satisfied if b G J- n (b) for all n, b, which is often 
the case in practice. 

4. T satisfies the doubling condition with respect to v if there is a constant 
Cd > such that for j/-a.e. b g B and every neN 

|U{-M 6 ') : rn<n,T m (b')nF„(b)^ $}\ <C d \F n (b)\. 
For a function / on B, consider the averages A n [F; /] defined by 

1 v 6'&f„(&) 

We are interested in the convergence properties of these averages. To explain 
what the limit function could be we need a few definitions: a set E c B is 
11(B) -invariant if E x i? n ft(-B) = E x E. For a Borel function / on B, let 
E{f\lZ(B)} denote the conditional expectation of / with respect to the a-algebra 
of 7vL(-B)-invariant Borel sets and the measure v. 
The purpose of this section is to prove: 

Theorem 2.1. If J- is asymptotically invariant, non-shrinking and satisfies the 
doubling condition then J 7 is a pointwise ergodic sequence in L . I.e., for every 
f G L x (B,v), K n \T;f\ converges pointwise a.e. and in L 1 -norm to E[f\lZ(B)} 
8Sn->oo. 

This result generalizes a classical ergodic theorem for amenable groups as 
follows. Suppose 7t(B) is the orbit-equivalence relation of an amenable group 
G acting on (£?, v) by measure-preserving transformations and {i^nj^i is a 
sequence of F0lner subsets of G (so lim, woo \ KF ^ A ^\ = q for any finite K C G). 
Let T n {b) ■= {fb : f e F n }. Then T — {Fn}%Li ^ s asymptotically invariant. 
The doubling condition for {F n }'^' =1 can be stated as: | U m <„ F^F n \ < Cd\F n \. 
This implies that T is doubling. It is usually assumed in classical theorems that 
the identity element e G F„ for all n. This implies T is non-shrinking. So the 
theorem above implies an ergodic theorem for amenable groups with respect to 
a doubling F0lner sequence of sets containing the identity element. This is not 
the most general ergodic theorem known for amenable groups (see e.g., |Li01j V 



Theorem 2.1 is obtained from the next two theorems which are also proven 



in this section. 

Theorem 2.2 (Dense set of good functions). If T is asymptotically invariant 
then there exists a dense set Q C L X (B) such that for all f G Q, A n [J-;f] con- 
verges pointwise a.e. to E[/|7?-b]. Moreover, if L\(B) is the set of all functions 
f G i 1 (-B) with E[f\lZ(B)} = a.e. then there exists a dense set Q C Lq(B) 
such that for all f G Go, A n [J; /] converges pointwise a.e. to 0. 
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Theorem 2.3 (L 1 maximal inequality). Suppose that T is non-shrinking and 
satisfies the doubling condition. For f 6 L 1 (_B), let M.[J-;f] be the maximal 
function 

M[T;f] :=supA„[J-;|/|]. 

n 

Then there exists a constant C > such that for any f £ ^(B) and any t > ; 

v ({beB: M[J;/](fc)>t})<* 

Assuming the two Theorems above, we prove Theorem |2.1| 

Proof of Theorem\2^ Let / G L l (B). We will show that {A^J 7 ; /]}£° =1 con- 
verges pointwise a.e. E{f\lZ(B)]. After replacing / with / — E[f\TZ(B)] if nec- 
essary we may assume that E[/|7£(.B)] = a.e. 

For t > 0, let E t := {b e B : limsup^^ \A n [F; f](b)\ < t}. We will 



show that each E t has measure one. Let e = According to Theorem 2.2 
there exists a function /i G L 1 (B) with ||/ — /i||i < e such that {A n [.F; 
converges pointwise a.e. to 0. Let n > 0. Observe that 

|A„[J-;/]| < |A„[^;/-/i]| + |A n [^;/ 1 ]|<M[^;/-/ 1 ] + |A n [^;/i]|. 

Let 

D := {be B : M[^;/-/i](6) < v^}- 

Since A„[J*; /i] converges pointwise a.e. to zero, for a.e. b £ D there is an 
A > such that n > N implies 

|A„[J-;/](6)| < M[T;f-f 1 ](b) + \A n [T;f 1 ](b)\<2^/e = t. 

Hence D d E t (up to a set of measure zero). By Theorem |2.3| 

Ct 

v{E t ) > y{D) > 1 - Ce^Wf - /i||i > 1 - VeC = 1 - ^. 

For any s < t, B s C i? t . So v(E t ) > v(E s ) > 1 - ^ for all s < t which implies 
v(E t ) = 1. So the set E := C^ = iEx/ n has full measure. This implies pointwise 
convergence of {A„[J"; /]}£° =1 . 

The fact that A„[J r ;/] converges to E[/|7£(fl)] in L X {B) follows from the 
pointwise result. To see this, observe that it is true if / £ L°°(B) by the bounded 
convergence theorem. Since L°° is dense in L 1 and is a contraction in L 1 
this implies the result. □ 

2.1 A dense set of good functions 



In this subsection, we prove Theorem |2.2| by generalizing von Neumann's clas- 
sical argument. So assume J- is asymptotically invariant. Let $ C Iim(lZ(B)) 
be a countable set generating 1Z(B) witnessing the asymptotic invariance. 
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Lemma 2.1. Let <p e ($), / € L°°(S) and dearie /':=/-/ o 0. TTien 
A„ [J 7 ; /*'] converges pointwise a. e. to E[f'\lZ(B)]. 

Proof. First, suppose ^ e $. Because J 7 is asymptotically invariant, for a.e. 
bGB, 

lim |A n [.F;/'](6)| = lim — I— ]T /(&')-/(«) 

\T n {b)A</>(^ n (b))\ 
< 2 / oo lim =0. 

Since i/ is ^(B)-invariant, E[/|ft(5)] = E[/ o 4>\Tl{B)]. Hence E[/'|ft(5)] = 
a.e. This proves the lemma in the case cj> G 

If /" := / -/ of 1 then f" o<j> = -f. So A„[J"; /" o 0] converges as 
n — > oo to the constant pointwise a.e. Since A„ [F n ; /" — f" o <p] also converges 
as n — > oo to pointwise a.e. (by the previous paragraph), it follows that 
AnlJ 7 ; f"} converges pointwise a.e. to as n — > oo. So the lemma is true for 
<f> G 

Now suppose that the lemma is true for two functions <fii,(f>2 G Inn(lZ(B)). 
It suffices to show that if / e L°°(B) and /':=/-/ o ^ o </> 2 then ^[/'] 
converges to pointwise a.e.. This follows from the decomposition 

/' = [/-/ o fc] + [/ o & - / o X o </> 2 ] 

and the hypotheses on </>i, ^>2- D 

Lemma 2.2. Lei f be a measurable function on X such that for every 4> G 

f = f ° 4> a - e - Then f is 1Z(B) -invariant. I.e., f(b) = f{b') for a.e. {b,b') G 

1Z(B) (with respect to v x c where c is counting measure). 

Proof. For each 4> G ($) , let 

B* = {beB: /(6)^/o0(6)}. 

Since $ is countable, the group ($) is also countable and 

v{ |J B,)=0. 

By definition if b <£ U^*) B <i» then f( b ) = /(<X b )) for a11 e ($)■ But this 
implies /(&) = /(&') for i/ x c-a.e. (6, 6') € TZ(B). □ 



Proof of Theorem \2.2\ Let I C L 2 (B) be the space of 72.(£?)-invariant L 2 func- 
tions. That is, / G I if and only if /(&) = &' for a.e. (b,b') € 7?-b. Let 
Q C L 2 (B) be the space of all functions of the form / — / o <f> for / <G L°°(B) 
and </> 6 ($). We claim that the span of I and Go is dense in L 2 (B). To see 
this, let /* be a function in the orthocomplcment of Qq. Denoting the L 2 inner 
product by (•,•), we have 

0= </*,/-/°<£) = </*,/) = (/.Jl-j/.of 1 ,/) = {f.-f.o<p-\f) 
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for any / g 
/* = /* o 
invariant; i.e 



and e ($). Since L°°(S) is dense in L 2 (B), we have 
1 for all <fi 6 ($). So the previous lemma implies /* is 1Z(B)- 
, /» £ 1. This implies I and <?o span L 2 {B) as claimed. 

„ [J 7 ; /] converges pointwise a.e 



By Lemma 2.1 for every / G I 
E[f\K(B) 



Go, 



to 



Since 1 + Go is dense in L 2 (B), which is dense in L 1 (B), the first 



statement of the theorem follows. The second is similar. 



□ 



2.2 A maximal inequality 

This subsection proves Theorem |2.3| We begin with a covering lemma general- 
izing the classical Wiener covering argument. 

Lemma 2.3. Suppose T satisfies the doubling condition with constant C'd > 0. 
Let p : Y — > N be a bounded measurable function where Y C B. Then there 
exists a measurable set Z <zY such that 

1. for all z 1 ,z 2 E Z, if z x ^ z 2 then F p ^){zx) C\ F p{z2) {z 2 ) = 0: 

2. 

C d ^(\J T p(z) (z)) >^(\J T p(y) (y)). 

Proof. Let T : B — > K be an injective Borel function. We will use T to break 
'ties' in what follows. 

If Y' C Y is a Borel set then we let M(Y') be the set of all 'maximal' 
elements of Y' . Precisely, y\ 6 M(Y') if y\ € Y' and for all y 2 E Y' different 
than yi either 

L Fp(vi)(yi) n:F p{y2) (y 2 ) = 0, 

2 - p{Vi) > Pim) or 

3 - ^piv^iVi) n^jfc) ^ 0, p(Vi) = p{V2) and T(y±) > T(y 2 ). 

Because p is bounded, if Y' is non-empty then M(Y') is nonempty. 

Let Y := Y and M := M(Y Q ). Assuming that Y n ,M n C F have been 
defined, let 

Y n+1 := {yeY: T p{y) {y) n = Vz e Af„} 

and M„+i = M(Y n+1 ). Let 

Z = \jM n , Y:=\J F p{y) {y), Z := |J J" p(z) (z). 

By construction, for all Zi , z-x € 2 if z\ ^ z 2 then ^ 7 p(^ 1 )(^i) H TpU^Xpi) = 0- 
For each z g M n , let 

S(z) := U{J"p fo )(2/) : y € F„, 7" p(w) (|/) n T p{z) (z) + 0}. 
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Note that Z is the disjoint union of all the M n 's. So S is well-defined as a 
function on Z. Since each z € M n is maximal in Y n , the doubling condition 
implies 

\S(z)\ < C d \F p{z) (z)\. 

The construction of Z implies Y = U zeZ S(z). So there exists a measurable 
function J : Y — > Z such that 

• for each y € y, J(y) € -F p ( z )(z) where z £ Z is an element with y 6 <S , (^); 

• < C d for all zd. 

For example, for each z £ Z, let J z : S'(z) — > J-p( z )(z) be a map so that 
|^ 1 ( z ')l — f° r au z ' e J~p( z )(z). This family of maps can be chosen to vary 
in Borel manner with respect to z. For any y g Y define J(y) := J z (y) where 
z G Z is the unique element satisfying 

• ye S[z), 

• > T(z') for all z' G Z with y G S(z'). 

Define K : B x B — > R by z) = 1 if J(y) = z and if (y, z) = otherwise. 
Since i/ x c| K(B) = c x 

= / IJ" 1 ^)! < C d v(Z). 

J z 

This implies the lemma. □ 

Proof of Theorem \2.3[ We assume that T is non-shrinking with constant C s > 
and satisfies the doubling condition with constant C d > 0. For n > 0, let 

M„[JT;/](6) := max A t [T:\f\]{b). 

l<i<n 

Let D„ >t := {b G S : M„[J"; /](&) > i}. It suffices to show that u(D n>t ) < ^{&L 
for each n > where C > is a constant. 

Let p : D n t — > N be the function p(b) — m if m is the smallest integer such 
that A m [J; |/|](£>) > i- Let Z C D„. t be the subset given by the previous lemma 
where Y = D n>t . Let Z = \j{F p(z) {z) : z G Z} and Y = \j{F p(y] (y) : y eY}. 
The non-shrinking property of T and the previous lemma imply 

C s v{D n +) < u{Y) < C d v{Z). 

The disjointness property of Z implies that for every z G Z there exists a 
unique element ir(z) G Z with z G FpUU)) ( 7r ( 2: ))- Since Z C D n t , 

v{D ntt ) < C- x C d v(Z) < ^± J^A pMz)) [T;\f\](n(z)) dz. 
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Let K : B x B -» R be the function 

l-7>(*(z))(7r(z))l 

if z e Z and y G •7>(7r(z))( 7r ( ;2 0)- Let K(y,z) — otherwise. Since ^ x c| K ( B ) 

di/(lO = | ^K(y,z) dv{y) = | £ K(y,z) dv{z) 
z zez y ez 

= J_A piAz)) [T;\f\](n(z)) du(z). 

So 

»{D n , t ) < % [A p(7T{z)) [T; \f\}(n(z)) dz = % f \f(y)\ dy < MZUl. 
This proves the theorem with C =^f. 

3 The free group and its boundary 

Let F = (ai,...,a r ) be the free group of rank r > 2. The reduced form of an 
element g € F is the expression g — si • • • s n with € S and Sj+i 7^ s^ 1 for 
all z. It is unique. Define |g| := n, the length of the reduced form of g. The 
distance function on F is defined by d(gi,g2) ■— Iffi" 1 ^!- 



□ 



3.1 The boundary 

The boundary of F can be represented in many equivalent forms. For example, 
it is the set of all geodesic rays in T emanating from the origin. Alternatively, 
it can be described as the set of all sequences £ = (&,&,...) G S N such that 
7^ C rl ^ 0r au * — 1- We denote it be <9F. A metric da on <9F is defined 
by dd((£i,£2 ; ■ • (^i,t2, ...)) = - where n is the largest natural number such 
that £j = for all i < n. If is an y sequence of elements in F and 

g% '■= U.i • • • ti,n,i is the reduced form of gi then lim^ gi — (£1, £2, • • •) G <9F if 
is eventually equal to £j for all j. If £ € d¥ then we will write £j € S for the 
elements in £ = (£1 , £2, £3, ■ • •)• 

We define a probability measure v on 9F as follows. For every finite sequence 
ti, . . . , t n with U + i 7^ t^ 1 for 1 < i < n, let 

. . .) € : & = *i VI < i < n}) := I^Ce)]- 1 = (2r - l)-"+ 1 (2r)- 1 . 

By the Carathcodory extension Theorem, this uniquely extends to a Borel prob- 
ability measure v on d¥. 
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3.2 Horofunctions and horospheres 

There is a natural action of F on d¥ by 

(*i • • • t n )£ := (ii, • • • , t n -k,£k+i,£k+2, ■ ■ •) 

where ti,...,t n £ S, t\ ■ ■ -t n is in reduced form and fc is the largest number 
< n such that & = t n+ x-i for all i < k. Observe that if g = ti • • • t n then the 
Radon-Nikodym derivative satisfies 

For £ G <9F as above, define the function 7i£ : F — > Z by 



For example, if g = £1 • • • £ n then frf (g) = — n. More generally, if g = £1 • • • £„ti • • • t r 
is in reduced form and t\ ^ then h^(g) = m — n. Alternatively, if 

s n = €l&---£n then 

ht (g) = lim d(g,s n ) - n. 

71— >00 

The function /i£ is the horofunction associated to £. Figure ^ illustrates a 
horof unction. 

A horosphere is any level set of a horofunction. Let H ^ denote the horosphere 
:= ^(O). Note 

H, = [ 9 edF: ^(0=1 

If £ = ^2) - • •) then g G i/^ if and only if the reduced form of g is 

9 = £i£2 • • • £, n ti ■ ■ ■ t„ 

for some h, . . . ,t n £ S such that £„+i ^ h (so .g -1 ^ = . . . jff 1 ,^!, . . .)). 

is called the horosphere passing through the identity e associated to £. 

The group F acts on horofunctions by g ■ h^(f) — h^g^ 1 /) for any g,f£ 
¥,£ £ <9F. The group also acts on horospheres by 

.</•//, !'//: /• //■!• 

Observe that ii g £ then g _1 _ff^ = H g -i^ and g^ 1 • /i£ = h g -i^. More 
generally, if g € F is arbitrary then 

^ =g-he- h^g' 1 ). 

Let 7?-o(5F) be the equivalence relation on d¥ given by (£,77) £ TZo(d¥) if 
and only if there is a g £ ¥ such that g£, = i] and = 1. In other words, 

g^ 1 £ H^. By definition, v is 7\Lo(<9F)-invariant. 
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Figure 1: The "upper half space" model of the rank 2 free group. 



4 Horospherical sphere and ball averages 
4.1 Statement of main convergence result 

Suppose that F acts on a standard probability space (X, A) by measure-preserving 
transformations. Then there is a natural equivalence relation 7Zo{X x <9F) on 
X x <9F. Namely, (x,^) is 7Zo(X x OF)- equivalent to (x',£,') if there exists a 
g € such that <7 _1 a; = x' and g _1 £; = Because f is invariant under 
lZ (d¥), the product measure A x v is lZ a (X x 9F) -invariant. For a function 
/ e L X (X x d¥), let E[/|ft (X x 9F)] denote the conditional expectation of / 
on the (7-algebra of 1Zq(X x <9F)-invariant sets. This is the ergodic mean of /. 
For n > and (x, $)elx <9F, let 



S n (x,0 :={(gx,gO^Xxd¥ 



g 1 G H s , \g\ =n}. 



This is the "sphere of radius n centered at (x, £)". Note that, if n is odd and 
£ G <9F then there does not exist a g _1 € i/^ with |g| = n. So iS n (a;,£) is empty 
in this case. 

The following is proven in the next section. 

Theorem 4.1 (Pointwise convergence for horospherical sphere averages). For 
n > let A 2n [S; ■} : L X (X x <9F) -> x <9F) fee tfte operator given by 



A 2n [S;f](x,0 = 



S2n{x,0\ 



(4.1) 



(x',€')£S2n(x,|) 



Tften /or any / G L'(Ix 9F) 7 f/ie sequence {A 2 „[5; converges pointwise 

a.e. and in L 1 norm to E[f\Tl (X x OF)]. 
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By Theorem 2.1 it suffices to prove that the sequence S := {S2n}^Li is 
asymptotically invariant, non-shrinking and doubling. This is accomplished in 
the next section. 

4.2 Balls and spheres 

Definition 4.1. For g £ ¥ and n > 0, let B n (g), S n (g) C F denote the ball and 
sphere respectively of radius n centered at g (with respect to the word metric). 
For £ £ d¥ and x £ X let 

B n (0 := {gC- g- 1 £H^nB n (e)}, 
S n (0 := {.g£: f'e^ns^)}, 

5„(x,£) := {(ji^GJrxaFlff-^ffjnS^e)}. 
Our goal is to prove that 5 is is asymptotically invariant, non-shrinking and 



doubling (Proposition 4.8). However, it is easier to first prove these properties 



for B :— {B n }'^L 1 and S :— {S n }^ =1 and then transfer them to B and S. 
Lemma 4.2. Let £ = (£1,^2, . . .) £ d¥. Then for any n>0, 

B2n(£) = {(*i,*2, ■) e d¥ ; U = V 4 > n}, 

S 2n (0 = {(h,t 2 , ...)£d¥: t n ^ £ n and U = & Vt > n}. 

Proof. This is an exercise. □ 

Definition 4.3. Let ir n : d¥ — > S n be the projection map ir n ((si, s 2 , ■ ■ ■)) = 
(si, . . . , s„). We will say that a map <j> '■ d¥ — >• d¥ has order n if it is measurable 
with respect to the cr-algebra generated by the inverse images of 7r n . In other 
words, <f> has order n if for any two boundary points £, £' G <9F such that 7r„(£) = 

#o = 0(0- 

Lemma 4.4. For any (£,£') G 7vlo(<9F), there exists a map 4> £ Inn(7£o(c)F)) 
smc/i i/iai </>(£) = £' and /ias order n for some n < 00. TVius i/ie sei of finite 
order inner automorphisms of TZ$(d¥) are generating. 

Proof. If (£,£') G K Q (d¥) and £ = . . .), f = . . .) then by definition of 
72.0 (d¥), there is an n such that i > n implies £j = Let m > n and let 
/8 : S m — > 5™ be a bijection such that 

• pernio) = n m (ei 

• /3 does not change the last coordinate in the sense that if (3(si, . . . , s m ) = 

[txi • • * ) tm) then 5 m ^mi 

• /3 maps the subset {(si, . . . , s m ) : Sj_|_i ^ sf x N\ < i < m} into itself. 
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Define <j> £ lnn(TZ (d¥)) by (j>(si, s 2 , . . .) = (*i,t 2 ,...) where (3(si, . . . , s m ) = 
(ti, . . . ,t m ) and Si = ti for i > m + 1. It is easy to check that <fi is an inner 
automorphism of finite order that maps £ to □ 

Lemma 4.5. TTie sequences B := {B2n}^=i arl ^ ^ := {^2n}^Li are asymptot- 
ically invariant. 



Proof. From Lemma |4.2[ it follows that B 2n is invariant under all inner auto- 
morphisms of order < n while S 2n is invariant under all inner automorphisms 
of order < n — 1. So the previous lemma implies this lemma. □ 



2r-l 



Lemma 4.6. TTie sequences B and S are doubling with constants 1 and 2r ~ 2 
respectively. 



Proof. It follows from Lemma 4.2 that if £, £' are such that $ 2 n(£) ^^m^') ^ 
and n> m then 2?2n(£) 3 $ 2to (£')• Therefore 

IJ {B2m(0 : ™ < *W£') H B 2n (£) ^ 0} = 8^(0. 

This implies is doubling (with doubling constant 1). 
It follows from Lemma S21 that 



|52»(0I 2r 



(4.2) 



Therefore, 



< 



|B 2n (OI 2r-L 

|U {<WO : m < n,5 2m (0 n S 2n (0 ^ 0} 
|U{ S 2m(e') : m < n,B 2m (e)DB 2n (0 ^ 0} 

This proves 5 is doubling with doubling constant ■ D 
Lemma 4.7. TTie sequences B and S are non- shrinking. 

Proof. Let Y C d¥ and p : Y — > N be a bounded Borel function. Let Y B = 
\J{B P (3i)(v) : 2/ e y}. Since Y C Y B we have i/(Y) < i/(Y e ). This proves B is 
non-shrinking. 

The proof that S is non-shrinking is a little more involved. Let Y s = 
\J{Sp(y)(y) '■ V G Y}. By Lemma 2.3 there exists a Borel set Z C Y such that 

1. for all z 1 ,z 2 E Z, B p(zi) (^i) D B p ( Z2 )(z 2 ) = 0; 

2. ifZ 8 = U zGZ S p(2) (z),then 

>^(y 6 ). 
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Define F : TZ (d¥) -> K by = if f ^ Z e . Otherwise, there exists a 

unique zeZ such that £' e In this case, define = ^ ^(z)! ^ 

£ G Sp(z)( z )- Let — otherwise. Let 

zez 

Since z/ x c| K[)(aF) = c x Htc (<9F), 

- / (E*f)) *<o - / |f| mo = 



The last equality uses (4.2). By the inclusions Z s C Y s and Y C F e , we have 



v{Y s ) > v(Z s ) = 2 -^ v (Z B ) > 2 ^—^v{Y B ) > 2 ?—^v(Y). 
V J - V > 2r - 1 V ' ~ 2r-l y ' ~ 2r - I y ' 

This proves 5 is non-shrinking. 



□ 



Proposition 4.8. The sequences B := {£>2n}^Li andS :— {<§2n}^Li are asymp- 
totically invariant, non-shrinking and doubling. 

Proof. The action of F on (d¥, v) is essentially free: 

v {{£_ £ d¥ : 3 5 € F \ {e} such that g£_ = f }) = 0. 

For any inner automorphism 4> 6 Inn(7?.o((5F)), there is an inner automorphism 
(T> £ Inn(7^o(5F x X)) defined by <j){x^) — (gx,g£) where g £ F is the unique 
element such that 0(£) = g£. This is well-defined on a set of full measure. 

Let $ c Inn(72o(9F)) be a countable set that generates TZo(d¥) and let 
$ = {4> : G $}. Then for i/-a.e. £ € <9F if is ft (SF x X)-equivalent 

to there is a unique g € F with (gx,g£) — {x',£') and a <E ($) with 

0(0 = Then </>(x,£) = (£',£') so ^ i s generating. 

The projection map d¥ x X — > d¥ maps B2 n (x, £) to S 2n (£) an d 4>{B2n{x : £)) 
to_i 0(# 2l i(£)) bijectively. So the asymptotic invariance of Z?, proven in Lemma 



4.5 implies the asymptotic invariance of B. The proof that S is asymptotically 
invariant is similar. 

For any (x, £ ) £ d¥ x X, the projection map d¥ x X — > d¥ maps 

U{e 2n (z',0 : B 2n {x' ,i')C\B 2n {x,i) ± 0} M- U{S 2n (£') : S 2n (£') nB 2n (0 ^ 0} 



bijectively. So the doubling property of B, proven in Lemma 4.6 implies B is 
doubling. The proof that S is doubling is similar. 

It is obvious that B is non-shrinking. The proof that S is non-shrinking is 
similar to the proof of the previous lemma. □ 
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5 Automatic ergodicity 



Let F act on a standard probability space (X, A) by measure-preserving trans- 
formations. Let F 2 be the subgroup of F generated by words of length 2, which 
has index 2 in F. For any / G L^X), let E[/|F 2 ] G ^(X) be the conditional 
expectation of / on the er-algebra of F 2 -invariant measurable sets. 

For / G L\X), define i(f) G I>{X x d¥) by i(/)(x,f) = f(x). The map 
/ ^ *(/) isometrically embeds in L 1 (X) into L X (X x d¥). 

The purpose of this section is to prove: 

Theorem 5.1. For any f G V-{X), E[i(f)\K [X x d¥)] = i(E[/|F 2 ]). 

Similar results were proven in [Bo08] for all word hyperbolic groups. 

We remark that it is necessary to consider the action of F 2 rather than F. 
For example, if X is a two-point set, A is the uniform probability measure and 
all generators {ai, . . . , a r } of F act nontrivially on X then the action of F on X 
is ergodic but the equivalence relation 7Zq(X x d¥) on X x d¥ is not. 

In the next section we prove: 

Theorem 5.2. For any F G L l (X X d¥), E[F\¥ 2 } = E[F\K (X x d¥)} where 
E[i^|F 2 ] denotes the conditional expectation of F on the a-algebra of ¥ 2 -invariant 
sets where F 2 acts on X x d¥ diagonally: g ■ (x, £) = (gx, <?£) \/g £ ¥,x € X,£ £ 
d¥. 



Theorem 15.11 follows from the theorem above and the next lemma. 

Lemma 5.1. For any f G L l {X), E[i(/)|F 2 ] = i(E[/|F 2 ]). 

Proof. Without loss of generality, we may assume that the action of F 2 on (X, A) 
is ergodic. It suffices to show that the diagonal action F 2 rv X x d¥ is ergodic. 

Let n be the uniform measure on the generating set S. Then the action of 
F on the Poisson boundary of the random walk determined by /i is canonically 
identified with F rx (d¥,v) (e.g., see [KaOO]). Note that the support of the 
convolution /i* (j, generates F 2 . Hence the action of F 2 on the Poisson boundary 
of the random walk determined by /i * fj, is identified with the action of F 2 on 
(d¥,v). By [AL05], this action is weakly mixing. This implies the diagonal 
action of F 2 on (<9F xI,i/xA)is ergodic. □ 

5.1 Proof of automatic ergodicity 



We now turn to the proof of Theorem |5. 2 1 We begin with a few definitions and 
a reduction. 

Definition 5.2. Let X C L 1 (X x <9F) be the set of functions / that are invariant 
under the relation, i.e., for all <f> G lmi{TZo(X x d¥)), f o <fi = f. 

Let I2 C x d¥) be the set of functions / such that for all g G F 2 and 



a.e. € X x d¥, f(x,t;) = f(g(x,l;)). Theorem 5.2 is equivalent to the 

statement X = X2 . 
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Definition 5.3. For £ X x <9F, recall that £ = (£1, &,...). Define 

P(x,£) e I x W by P(x,£) = f^foO- More generally, if n > 1 then let 
P n M) := (^•■•W-Hs.O- 

Lemma 5.4. Let / G L l {X x 9F). J/ / o P 2 = / o.e. tfien / G X 2 . 

Proof. Let (x, e I x 9F and <? = t\ ■ ■ ■ t2n G F 2 be in reduced form. By 
definition, 

where k is the largest number such that = t2 n +i-i f° r a U i < k. For 
any a; G X, if k is even then {gx,g£) G P~ (2 "~ fc) P fc (>, £)• If fc is odd tnen 
(s^ffO e P~ (an_fc+1) P* +1 (a;,0. Thus if / o P 2 = / a.e. then fog = f a.e.. 
This implies the lemma. □ 



Proposition 5.5. To prove Theorem 5.2, it suffices to prove that f o P 2 = / 
for all / el. 

Proof. From the above it follows that if / o P 2 = / for all / G I then X C To 
see the reverse inclusion, let (x,£), (x',£') G X x 9F be 7£o(X x <9F)- equivalent. 
By definition, there exists g G F such that (x',£') = (gx,g&- As noted above, g 
is necessarily in F 2 . Thus if / G T% then for a.e. pair (x, £), (x', £') of 7?-o(A"x9F)- 
equivalent points of X x <9F, /(x,£) = /(x',£'), namely f £ I. This shows 
l 2 Cl □ 



The next proposition is the key geometric result in the proof of Theorem 5.2 



Define P d : d¥ -> <9F by P 9 (£) = £f Recall that d a is a distance function 
on <9F defined by GJa((£i,£2) • ■ ■), {tiit^i ■ • ■)) = where n is the largest natural 
number such that £j = i$ for all i < n. 

Proposition 5.6. Let n > 5 be an integer. Then there exist measurable maps 
: d¥ — > d¥ such that 

i. veeflF, ^(MO.Pi "(£)) = 

<?. £/ie graphs of lo and ip are contained in lZo(d¥); 

4. V£ G 9F, 3.g G F such that ipuj(£) = gw(£) and P|w(f) = 

5. V/ G L^flF), 

maxdl/owHi.H/o^K) < (2r-l) 2 ||/|| 1 
where r is the rank of the free group ¥. 

Proof. Recall that S = {a\, . . . , a r , a^" 1 , . . . , a" 1 } is the chosen generating set 
of F. Let K : S 3 — >• S 3 be a bijection so that for any (s n -l> s n, s n +i) G 6> 3 , 
if(s„_i,s„,s n+ i) = (s n _i,s^,s„+i) for some ^ {s~l 1; s„, s~+ x ). 



19 



Define ui : d¥ — > d¥ by w(si, S2, . . .) = (ti, tai ■ ■ ■) where = s$ for all i 7^ n 
and i„ = s' n where K (s n -i, s ra , s n +i) = (s n _i, s^, s n +i)- By its definition cj is 
invertible, Borel, da(£,w(£)) = ~ for any £ 6 <9F and w*^ = 2/. Moreover since cj 
does not change the tail of the sequence (i.e., because ti = for all sufficiently 
large i), the graph of to is contained in lZ (d¥). Because u> is measure-preserving, 
||/ow||i = H/lli for any f e L\d¥). 

Define V : d¥ -> d¥ by 

i)U)(a 1 ,S2, ■ ■ ■) = (S3, • • • , s„-i, s^, s" 1 , s^, s n+ i, s n+2 , ■ ■ ■) 

where K(s n -i, s n , s n +i) = (s n -i, s' n , s n +i). Because a; is invertible, ?/; is well- 
defined. 

Note that the m-th coordinate of tpuj(si, S2, ■ ■ ■) equals the m-th coordinate 
of w(si, S2, ■ ■ •) if 771 > n. Therefore, the graph of ^ is contained in d¥. If 
£ = (si,s 2 , . . .) then 

P|w£= (s 3 ,...,s n _x,s' n ,s n+ i,...). 

Thus tfe^wf, Pgwf) = i Note that P 2 a cj£ = (s 3 • • • s n _i)<(si ■ • • s^)" 1 ^ 
Similarly, -i/Wf = (53 • • • s n -i)s' n (si ■ ■ ■ s n ) u>£. This proves the fourth item. 

We claim that -0 is at most (2r — l) 2 -to-l (that is, for each b G d¥, b has at 
most (2r — l) 2 -preimages under tp). Because w is invertible, it suffices to show 
that i/jui is at most (2r — l) 2 -to-l. Suppose that (ui, U2, . . .) G d¥ and 

ipu(ux,u 2 , ■ ■ .) = <M«i, s 2 , • • .) = (s 3 , . . . , s n _i, s^, s" 1 , s^, s n+ i,s n+2 , . . .). 

By definition of tpui, Ui — Sj for i > 3. Since there are (2r — l) 2 choices for 
(ui,U2) the claim follows. 

Since the graph of ip is contained in TZ (d¥) the claim implies ||/ o -0 1 1 x < 
(2r-l) 2 ||/|| 1 for all / G L 1 ^). 

□ 

Lemma 5.7. There exist measurable maps fl n : X x 9F — > X x 9F (/or 

n = 6, 7, . . .) such that 

1. for all feL^Xx d¥), lim,^ ||/ o <&„ o O n - / o P 2 o^Hj = 0; 

2. for all feL^Xx d¥), lim™ ||/ o Q n - f\\, = 0; 

3. the graphs of $ and "J are contained in TZ (X x d¥) . 

Proof. For n > 5 an integer, let tp and u> be as in Proposition |5.6| Fix (x, £) G 
I x 9F and let 91,32 G F be such that <Ji£ = w(£) and (72$ = Define 

fln(Z)0 := (aCjfl'lO) ®n(v,Q := (fll^Si*;) and *„(x,£) := (g 2 x,g 2 £,)- 

Since the graphs of ^1 and w are contained in TZ (d¥), the graphs of <l> n and 
are contained in 1Zq(X x d¥). Let dx be a metric on X that induces its 
Borel structure and makes X into a compact space. For (x, £), (x', £') G X x d¥, 
define d*((x, £), (a/, £')) = dx(x,x') + By the previous proposition, 

&(Q„(£,0,(z,0) =da(w(0.0 = V«- Also d^ n oQ n (x^),P 2 o^ n (x,0) = 
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1/n. So if / is a continuous function on X x <9F then the bounded convergence 
theorem implies 

lim ||/o¥ n ofi n -/oP 2 o* n || 1 = 

71— tOO 

lim H/o^-ZHi = 0. 

n— > oo 

It follows from the previous proposition that the operators / i— > / o fi n , f i— > 
/ o $„ and / n- / o \ff„ are all bounded for / G x <9F) with bound 

independent of n. It easy to see that / i-» / o P 2 is also a bounded operator 
on L X (X x 9F). Since the continuous functions are dense in L X [X x <9F), this 
implies the lemma. □ 



We can now prove Theorem 5.2 



Proof of Theorem\5 . 4 By Proposition |5.5[ it suffices to show that /oP 2 = / 



for every / G I. Let <& n , fi„ (n = 6,7, . . .) be as in the previous lemma. 
Because f El and the graph of & n is contained in Hq{X x d¥), it follows that 
/ o ^> n = f for all n. An easy exercise shows that P preserves the equivalence 
relation: if (x, £) is K a (X x <9F)-equivalent to (y, £') then P(x, f ) is "K (X x d¥)- 
cquivalent to P(y, £')• It follows that / oP 2 el So / o P 2 o$„ = /oP 2 for all 
n. We now have 

ll/-/°P 2 ||l = ll/-/°P 2 °$n||l 

< ||/ - / o * n o finlli + ||/o o fi n - / o P 2 o* n ||! 
= ||/-/o^n||l + ||/o*„o^-/°P 2 o^||l. 

The previous lemma now implies / = / o P 2 as claimed. 

□ 



6 Proofs of ergodic theorems 

6.1 Applying the convergence of spherical-horospherical 
averages 

Collecting results of the previous sections, we can now prove: 

Corollary 6.1. Let F act by measure-preserving transformations on a proba- 
bility space (X, A). For / e L X (X), let E[/|F 2 ] be the conditional expectation 
of / with respect to the er-algebra of F 2 -invariant sets. Then for A x v-a,.e. 
eXxd¥, 

E[f\¥ 2 }(x)= limA 2n [5;i(/)](.,0= lim ) £ f(x'). 

1 2?n ' Wl (x',£')es 2n (x,«) 
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Proof. By Theorem |4~Tj for a.e. G X x OF, 

E[i(f)\TZ (Xxd¥)}(x,O= lim ' 



E /oo- 



ByTheorem5.1 E[i(/)|^ (Xx9F))] = i(E[/|F 2 ]). So, E[i(/)|ft (XxdF))] (a, f) 



E[/|F 2 ](a;) for a e. eIx9F. 



□ 



In the next section, we will need the following strong L p -maximal inequality. 
Recall that for f e L\X x d¥), 

M[S;f] :=supA[5;/]. 

n 

Proposition 6.2. For every p > 1 i/iere is a constant C p > smc/i i/iat /or 
every / G L P (X x 9F), ||M[<S;/]|| P < C p ||/|| p . Moreover, there is a constant C\ 
such that if f G L log + A), i/ien 



M[5;/] ^^CxH/IU^. 



Proof. It follows from Proposition 4.8 and Theorem 2.3 that for any / G L X (X x 
<9F) the weak- type (1,1) maximal inequality holds: 



A x 



/]>*}) 



< 



C\\f\\ 



for some constant C > 0. 

The first part of the proposition now follows from standard interpolation 
arguments. Namely, since the operator / H> M[<S; /] is of weak-type (1, 1) and 
is norm-bounded on L°°, it is norm-bounded in every LP , 1 < p < oo (see e.g. 
|SW71I Ch. V, Thm 2.4]). 

Finally, given the weak- type (1,1) maximal inequality, the fact that when 
/ G Llog + L(X, A), the maximal function is in fact integrable and satisfies the 
Orlicz-norm bound is standard, see e.g. |DS[ p. 678]. □ 



6.2 Averaging over the boundary 

Throughout this section we let 1 < p, q < oo be such that 1 + | = 1. Let 
ip G L q (d¥, v) be a probability density on the boundary, namely if> > and 
Jip du = 1. For / G LP(X,X), recall that i(f) G LP(X x d¥) is the function 

*(/)(*,£) = /(*)■ 

The goal of this subsection is to prove: 
Proposition 6.3. For f G LP(X, A) and n>0, define A 2n [i/j; f] G L P (X) by 



A 2n [<S; *(/)](*, £)^(£) <M0- 



T/ien A2n[V'i /] converges point-wise a.e. to E[/|F 2 ]. Furthermore, if ip is essen- 
tially bounded then the same conclusion holds for any f G Llog + L(X, A). 
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The proof of Proposition 6.3 uses the following 



Lemma 6.4. Let p,q,ip, f be as above and define M[t/>; /] := sup„ A2 ra [ip; /]. 
Then there exists a constant C p > ( depending only on p) such that for every 

feLP(x,\) 

||M[V;/]||p<G p ||V||,||/||p. 

Furthermore, there is a constant C\ > such that if ip is bounded, then for any 
f € L log + L(X, A) we have 

\M4>->f\h< CiWWMlukl. 

Proof. Let us start with the case 1 < p < oo. For a.e. x £ X, 

|M[^;/](x)| P = sup|A 2n [^;/]( 2 ;)| p 



sup 



A 2n [S;i(f)](x,Om <M0 
IIP \\M\P 



< sup \\A 2 „[S;i(f)](x, 0IIL>(9F)IMIz,<<(aFy 



The last line above is justified by Holder's inequality. Next, we observe that for 
any n > 1, 



sup||A 2n [5;i(/)](a;, OlljWaF") dA ( a; ) 

X n 



sup 



|A 2 „[5 ;i (/)](x,e)| p di/(OdA(x) 



< 



X n J3F 

M[S; »(/)](*, £) p ^(O^A(x) 



= ||M[5;t(/)]||^ (XxOT) . 
Putting this together with the previous inequality we obtain 



\\m;fW Lr 



Lp(X) 



[tP-J](x)\ p d\(x) 



X 



< ||M[5;i(/)]||J p(XxaF) ||^||J, (OT) . 



The first part of the lemma now follows from 

||M[V;/]|U n x) < ll^llL,(W)l|M[5 ;i (/)]|| iP(XxaF) 

< Cp||^||i«(SF)||»(/)||if(JfxSF) = CplMU«(91F)||/|Up(X) 



where C p > is as in Proposition 6.2 



The second part of the lemma follows in exactly the same way, taking / € 
Llog + L(X), p = 1 and g = oo above. Using the integrability of the maximal 
function and the norm bound 



< Cl ll/ll(Llogi)(X) 



together with the boundedness of ip, the desired estimate follows. 



□ 
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Proof of Proposition \6.3\ We will firs t pr ove the proposition in the special case 



in which / G L°°{X). By Corollary 6.1 A 2n [S]i(f)] converges pointwise a.e. 



to i(E[/|F 2 ]). By the Lebesgue dominated convergence theorem, this implies 
that for a.e. x G X, {A 2n {ip; /](a0}£° =1 converges to J E{f\F 2 ](x)iP(£) di/(f) = 
E[/|F 2 ](x). This finishes the case in which / G L°°(X). 

Now suppose that / G L P (X). After replacing / with /— E[/|F 2 ] if necessary, 
we may assume that E[/|F 2 ] = 0. Let e > 0. Let /' G L°°(X) be such that 
||/ - f|| p < e and E[f |F 2 ] - 0. Clearly : 

|Aan[?MI < \A 2n [^f-f}\ + \A 2n [t,f}\<M[ij-J-f} + \A 2n [^f}\. 

Since A 2n [ip; f] —>■ pointwise a.e., it follows that for a.e. x G X, 

limsup|A 2 „[^;/](z)| < Mty; f - f'](x). 

n 

The previous lemma implies: 

|| hmsup |A 2 „[^; /]||| p < ||Mfo&; f - f']\\ P < C\\f - f'\\ p < Ce. 

n 

Since e > is arbitrary, it follows that || limsup„ |A2„[f/'; /]|||p = 0. Equivalently, 
A 2n [ip; f) converges to pointwise a.e. 

The second part of the proposition follows similarly using approximation in 
the Orlicz norm. 

□ 

We can now state the following corollary, proved previously for L p , p > 1 in 
[Ne94] [NS94] and for L log+ L in [BuOO] , 

Corollary 6.5. Let p > 1 and / G L P (X) or more generally / G L\og + L(X). 
Then for a.e. x £ X, 

E[/|F 2 ](x) = lim — ip- fa' 1 *)- 
Proof. This follows from the previous proposition by setting ip = 1. □ 
6.3 Comparing averages on the boundary 

We now turn to establish that each operator / H > A 2n [V>; /] has a form similar 



to that of the operator /i 2n from Theorem 1.2 Namely, / n- A2„['!/'; /] is given 
by averaging with respect to a probability measure on F. Recall that we have 
already associated with a probability density ip on the boundary a seq uence 
of probability measures on the group, namely H 2n , from Theorem |l.l[ The 
sequence of probability measures we define now is different and will be denoted 



772„. In order to define it we need some definitions. 
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Definition 6.6. Let t\ ■ ■ ■ t^n — g be the reduced form of an element g G F 2 . 
Define 

O'(g) = 0{ti ■ ■ ■ t n ) - 0(h ■ ■ ■ t n t n+1 ) c d¥ 

where O(-) is as defined in the introduction. An elementary exercise reveals 
that £ G 0'(g) if and only if h^g) = 0. 

Define rj^ E / X (F) by 

Vt(9):= {2r _ 2){ l_ ir -,J , ( J^ 
if \g\ = 2n and otherwise. 

Lemma 6.7. For any function f G L P (X) (p > lj, any n > and any x G X, 

A 2n [^ ; /](*) = E/(<rM<(<?)- 

PVoo/. Let £ G 9F. Recall that for x E X, 

S 2n (x,0 = {GtV'T^) = 5 e ff f n S 2n (e)}. 
implies n S 2 „(e)| = (2r - 2)(2r - l)*" 1 . So, 



Lemma 



4.2 



A 2n [S; *(/)](*, = - ] - E i(/)(^,0 



(2r-2)(2r- l)™- 1 ^ ^ 9 ^' 



Thus, 



A 2 „[^;/](x) = ^ / E /Cff -1 *)^) dv{i). 

v A ; « ec %e-tf 5 ns 2 „(e) 

Since g E H^C\ S2 n (e) if and only if £ G O'(g), it follows by switching the order 
of the summation and integral above that 

(2r-2)(2r-l) ge s 2ji(e) 

= E vl(g)f(9- 1 x) = vt(f)- 

ges 2n (e) 



□ 



Recall that irg : ^(F) — > L 1 (9F,i/) is the linear map satisfying 7rg((5 s ) 
^(Off) _1 Xo 9 and if ip G i^SF, ^) then ^ G £ X (F) is the function 

JO(g) 
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if g is in the sphere S n (e). Otherwise fJ-tig) — 0- The set O(g) is the shadow of 
g (with light-source at e) defined in [1.1 

Lemma 6.8. Let ip G L q (d¥,v). Then {^d{vtn)} ! n'=i converges to ip in L q - 
norm when 1 < q < oo, and uniformly if ip is continuous. 

Proof. For n > 1, let E[-0|£„] be the conditional expectation of ip on E„, the 
cr-algebra generated by {0{g) : g G S n {e)}. Thus 

v { U g) JO(g) 

if £ G 3 with ,g e 5„(e). Note that i/(O fl ) = | (e) | ~ 1 = ( 2r)(2T . 1 „ 1 )„-i , and 

thatE[^|S n ](0=7r a (/i^)(0. 

By the martingale convergence theorem, E[^|S„] converges to ip in L q as 
n —> oo. Noting that 

7r 9 (r?t) = (2r _ 2X2^- l)n-i (l^e)!'^^] - | x (e) | ~ 1 IE I x ) ) 

convergence of ^d{ r itn) to $ m follows immediately. When ip is continu- 
ous on the boundary it is uniformly continuous and then clearly E[?/>|£ n ](£) — 
7i"d(jti„)(£) converges uniformly to ip. 

□ 

This next result is not needed for the main theorem; it seems interesting for 
its own sake. 

Proposition 6.9. As above, let 1 < p, q < oo be such that | + i = 1. Let 
/ G L p (X) /or some p' urci/j p < p' . For x € X and n > 0, define f x .m G l p (¥) 
by fx,2n{g) = f{g~ X x) if 9 € <$2n(e) and f x ,2n(g) = otherwise. Let 7r' 9 (/ x , 2 n) : 
OF — > K &e £/ie function 

■Kd(fx : 2n) ■= ^2 fx,2n(g)XO{g)- 

ges 2n (e) 

Then, for a.e. x G X , {^'g{fx,2n)}^ = i converges to the constant function £ i-> 
E[/|F 2 ](x) in the weak topology on L p (d¥,v). 

Proof. For p G D>{d¥,v) and ip G L q (dF,v), let (p,^) := / aV. It suffices 
to show that for any ip G L q (d¥, v) and a.e. x E X, (Tr' d (f x ,2n),ip) converges to 
E[/|F 2 ](a;) J ip dv. By linearity, we may assume that ip > and J ip dv = 1. 
Observe that 
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It follows from Proposition |6.3| that for a.e. 

{^d(fx,2n),^d(V2n)) = A 2nbP; f] 

converges to E[/|F 2 ](a;). It follows from the previous lemma that ip — Kdivtn) 
converges to zero in norm. Since ||7Tg(/ Xl 2n)|| involves the uniform spherical 
average of it follows from Corollary 



6_5 

E[|/|p|F 2 ](o;) 1 /p for a.e. x € X, where we a 
|/|P G Lp'/p(X) with p'/p > 1 
By Holder's inequality, 



that \\n' d (f x ^n)\\ P converges to 
so use p < p' to conclude that 



n'o(f X ,2n),i>-'Ka(vt n )) < \Wd(fx.2n)\\pU ~ ^d{rii n )\\ 



tends to zero as n — > oo. Thus equation (6.1 1 implies the proposition. □ 



Remark 6.10. Typically, Tr' g (f X} 2n) does not converge to E[/|F 2 ](a;) in norm. To 
see this, observe that ||7rg(/ X) 2 n )||p converges to E 2 [|/| p ](a;) 1 / p (for a.e. x € X). 
The norm of the constant function £ i-> E[/|F 2 ](a;) is |E[/|F 2 ](x)|. Unless / 
is constant on the ergodic component containing x, Jensen's inequality implies 
E[|/|P|F 2 ](a;) 1 /i' ^ |E[/|F 2 ](x)|. This uses p > 1. 

6.4 Proof of the main theorem 



We now turn to the proof of Theorem 1.2 whose formulation we recall for the 
reader's convenience. 

Theorem 1.2. Let {/^nl^Li &e a sequence of probability measures in £ (¥) 
such that [iin is supported on S2 n (e). Let 1 < q < oo, and suppose that 
{^di^n)}^! converges in L q (d¥, v). Let (X, A) be a probability space on which 
F acts by measure-preserving transformations. If f G L P (X), 1 < p < oo and 
i + | < 1, then the averages 

H2n(f)(x) := f(9~ 1 x)p-2n(g) 

converge pointwise almost surely and in L p '-norm to E[/|F 2 ]. Furthermore, if 
q = oo and TTg(ii2n) converge uniformly, then pointwise convergence to the same 
limit holds for any f in the Orlicz space (L\og L)(X, A). 

Proof of Theorem \l.S\ To begin, we assume 1 < q < oo. Let p' > 1 be such 
that i + i = 1. Since i + ~ < 1, it follows that p' < p. Let / E L P (X). Choose 
a measurable version E[/|F 2 ] of the conditional expectation. Let ip be the limit 
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of {TTg(fj,2n)}^=i- Let X' C X be the set of all x G X such that 

1 



E[f\F*](x) 



E /Or 1 * 

ges 2 „(e) 



(E([|/r'|F 2 ](x) 



l/p' 



Jim_A 2 „[V>;/](X) , 
1 



i/p' 



n-voo I |S , 2 „(e)| 



e i/^r 



ses 2 „(e) 



By Proposition 6.3 and Corollary 6.5 X(X') — 1. For x £ X' and n > 0, let 
f x ,2n G l p '(¥) be the function f x ,2n(g) ■= /(s -1 ^) ifg G S2 n (e) and/ x>2n (p) := 
otherwise. By Lemma |6.7| and Holder's inequality for functions on F, 



\^2n(.f){x) - A 2n [tp; f)(x)\ = j ^ f(g lx ){^n(g)-V2n(g))\ 

SGS 2 „(e) 

< ||/a:,2n||fp'(F)llM2n ~ ^fen lli«(F) • 

Recall that ir a : ^(F) -> L^SF,!/) is defined by 7r a (<5 s ) = ^(O 9 ) _1 xo( s ) 
|5 l 2n(e)|xo(g) if Iff I = 2n. It now follows that: 



llMan-f^JI^OF) = ( E lM2n(5) -f/2„(3)| 9 

ses 2 „(e) 



1/9 



E 

v ses 2n (e) 



^2n(ff) - %»(#) dv 



1/9 



< 



£ v{o g y- 1 f ka(M2„)(0-^(O(6l 9 <M£) 



1/9 



SSS' 2 „(e) 

= |5 , 2«(e)r 1/p '||7ra(^ 2 „) - 7ra( 7 72n)IU*(8f>)- 
Combining this with the previous inequality, we have 

| Man (/)(») - A 2 „[V';/](a;)| < |S2n(e)|~ 1/p ||/ X ,2«||^'( F ) ||Ta(At2«) - ^a(f?2n)IU«(OT>)- 

The definition ofX' implies |S , 2n (e)|- 1 / ?, '||/ X: 2n||^' ( F) tends to E[|/|p' IF 2 ]^) 1 ^' 
as n — ¥ oo. Lemma 
as n — >■ co. So 



6. 



implies that ||7ra(^ 2n ) - ^d{il2n)\\Li{aw, v ) tends to 



lim /i 2 „(/)(z)-A 2n [^;/](a:) =0. 
The definition of X' now implies 

lim M2n(/)(a:)=E[/|F 2 ](x). 

n— ^oo 

This proves the pointwise result if 1 < q < oo. 
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As to the case q — oo, uniform convergence of 7ra(/i2n) implies that the limit 
fun ction if) is continuous on the boundary. Therefore the second part of Lemma 



6.8 



gives the uniform convergence of ng (77, 



6.5 



Of ||7TaO»2n) - ^8(%JIU°°(@S>) to zer0 

l^a»(c)|- 1 ]|/ e> a»||/x C F) to E[|/||F 2 ](x) if / e L\ogL(X^X) 



) to ij), and t hus also the convergence 
gives the convergence of 
Using these two facts 



Corollary 

2nWI ~\\Jx,2n\\e~>-(W) 

the same arguments used above establish the desired result also in the case when 
p = p' = 1 and q = 00 provided f £ L log L(X, A). 

Finally, we note that the fact that /i2n(/) converges to E[/|F 2 ] in L p -norm 
(if p > 1) follows from the pointwise result by a standard argument (e.g., see 
the end of the proof of Theorem 4.1 ). 

□ 
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